Supporting Information Extraction from Visual Documents
نویسندگان
چکیده
منابع مشابه
Supporting Information Extraction from Visual Documents
Visual Information Extraction (VIE) is a technique that enables users to perform information extraction from visual documents driven by the visual appearance and the spatial relations occurring among the elements in the document. In particular, the extractions are expressed through a query language similar to the well known SQL. To further reduce the human effort in the extraction task, in this...
متن کاملSupporting Visual Information Extraction from Geospatial Data
The Spatial Relation Query (SRQ) tool is a graphical software system, supported by a SQL-like query language, that enables users to perform information extraction driven by the visual appearance and the spatial arrangement of the information. The tool has been initially designed to support visual information extraction from web pages. Indeed, its former underlying spatial relation formalism rel...
متن کاملExtraction of Temporal Information from Documents
Temporal information in the document is a demanding dimension to be discovered. Recently, many works from board research areas such as database, information retrieval, text mining pay attention to a temporal aspect. In this paper, we give a survey of state-of-art in extracting temporal information from document collections. As it is quite a new discipline, there is no standard comparison scheme...
متن کاملTracking Information Extraction from Intelligence Documents
We describe here some of the research underlying the development of KANI (Knowledge Associates for Novel Intelligence), a hybrid system that combines large scale information extraction (IE) with knowledge representation (KR). The combination of these two technologies raises numerous research problems, such as an evaluation and understanding of the requirements that KR puts on IE and vice-versa,...
متن کاملInformation Extraction from Online XML-encoded Documents
Online reference documents tend to be semi-formatted in that they contain repeated sections with similar structure, and have free-text inside each section. XML (extensible markup language) enables document designers to design rich tag sets where tags for section headings contain information about each section. This contextual information, coupled with the fact that the free-text sections of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computer and Communications
سال: 2016
ISSN: 2327-5219,2327-5227
DOI: 10.4236/jcc.2016.46004